Extracting Relevance from Virtual Investing-Related Community Postings
نویسندگان
چکیده
The rapid growth of online investing and virtual investing-related communities (VICs) has a wideraging impact on research, practice and policy. In this context, this research addresses how information is generated, discussed, and diffused within and across VICs, and how such activities impact market efficiency. Regulators are particularly interested given the potential for fraud and spreading of false rumors. However, understanding information processing in VIC is a challenge given enormity of posted messages. Automated analysis of these messages is primarily complicated by three factors: (a) the amount of irrelevant messages or "noise" messages (e.g., spam, insults), (b) the highly unstructured nature of the text (e.g., abbreviations), and finally, and (c) the wide variation in what is considered relevant information for a given company. We have developed a mechanism relying on commonly occurring terms and a set of classifying criteria to identify: (1)"noisy" messages that bear no relevance to the topic at hand, (2) messages that have relevance to the topic at hand, but do not express an opinion as to the quality of the investment, and (3) messages that are both relevant and express a sentiment about the quality of the investment. To test our mechanism we have collected approximately 3 million messages related to 46 stocks over a 2-year period. Preliminary results show sufficient promise to classify messages and how participants react. Preliminary results show the classifier a classification accuracy of 54% . 1 This research is supported by NSF ITR grant number IIS-0218988
منابع مشابه
Classification of Virtual Investing-Related Community Postings
The rapid growth of online investing and virtual investing-related communities (VICs) has a wide-raging impact on research, practice and policy. Given the enormous volume of postings on VICs, automated classification of messages to extract relevance is critical. Classification is complicated by three factors: (a) the amount of irrelevant messages or "noise" messages (e.g., spam, insults), (b) t...
متن کاملRead-only participants: a case for student communication in online classes
The establishment of an online community is widely held as the most important prerequisite for successful course completion and depends on an interaction between a peer group and a facilitator. Beaudoin reasoned that online students sometimes engage and learn even when not taking part in online discussions. The context of this study was an online course on webbased education for a Masters degre...
متن کاملCompetition among Virtual Communities and User Valuation: The Case of Investor Communities
Virtual communities are becoming a significant source of information sharing for consumers and businesses. This research examines how users value virtual communities and how virtual communities grow and compete with each other. In particular, the nature of trade-offs between network size and information quality, and the sources of positive and negative externalities are examined. We address the...
متن کاملVirtual Community Ventures: Success Drivers in the Case of Online Video Sharing
A recent wave of Internet-related entrepreneurship focused on virtual communities. It produced User-Community-Driven Internet ventures (UCDI-ventures), characterized by (1) user-contributed content, (2) network effects, and (3), an interactive community. Whereas light pole examples such as YouTube, MySpace, or Facebook have received high capital market valuations, many other ventures have faile...
متن کاملLanguage Difference in Virtual Communities in Cyberspace: Blogosphere, Wikis and Social Network Sites
As the Internet population grows worldwide, new population using minor languages keeps joining the Internet. At this point, understanding how language difference affects virtual communities on the web becomes more and more important. As virtual communities, blogosphere, wikis, and social network sites were investigated. I argue that language difference serves as a barrier in the virtual communi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005